Genome-wide identification and characterization of superoxide dismutases in four oyster species reveals functional differentiation in response to biotic and abiotic stress

Background Oysters inhabit in the intertidal zone and may be suffered from environmental stresses, which can increase the production of reactive oxygen species (ROS), resulting in mass mortality. Superoxide dismutases (SODs) protect oysters from ROS damage through different mechanisms compared with vertebrates. However, the molecular and functional differentiation in oyster SODs were rarely analyzed. Result In this study, a total of 13, 13, 10, and 8 candidate SODs were identified in the genome of Crassostrea gigas, Crassostrea virginica, Crassostrea hongkongensis, and Saccostrea glomerata respectively. The domain composition, gene structure, subcellular locations, conserved ligands, and cis-elements elucidated the SODs into five groups (Mn-SODs, Cu-only-SODs, Cu/Zn ion ligand Cu/Zn-SOD with enzyme activity, Zn-only-SODs, and no ligand metal ions Cu/Zn-SODs). For single domain Cu/Zn-SODs, only one cytosolic Cu/Zn-SOD (cg_XM_034479061.1) may conserve enzymatic activity while most extracellular Cu/Zn-SOD proteins appeared to lose SOD enzyme activity according to conserved ligand amino acid analysis and expression pattern under biotic and abiotic stress in C. gigas. Further, multi-domain-SODs were identified and some of them were expressed in response to biotic and abiotic stressors in C. gigas. Moreover, the expression patterns of these genes varied in response to different stressors, which may be due to the cis-elements in the gene promoter. Conclusion These findings revealed the most extracellular Cu/Zn-SOD proteins appeared to lose SOD enzyme activity in oysters. Further, our study revealed that only one cytosolic Cu/Zn-SOD (cg_XM_034479061.1) may conserve enzymatic activity of SOD. Moreover, the expression patterns of these genes varied in response to different stressors, which may be due to the cis-elements in the promoter. This study provides important insights into the mechanisms through which oysters adapt to harsh intertidal conditions, as well as potential biomarkers of stress response in related species. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08610-9.


Background
Superoxide dismutases constitute a group of enzymes that catalyze the conversion of superoxide anion (O 2− ), a hallmark of reactive oxygen species (ROSs), into Open Access *Correspondence: linzhihua@zwu.edu.cn; qxue@zwu.edu.cn hydrogen peroxide (H 2 O 2 ) and oxygen molecules using a metal cofactor as the catalyst [1][2][3]. These biochemically similar metalloenzymes are diverse at the genetic and molecular levels. They are categorized into three classes, Cu/Zn-SOD, Fe-SOD/Mn-SOD, and Ni-SOD, and use Cu 2+ , Fe 2+ or Mn 2+ , and Ni 2+ as catalytic metal cofactors, respectively [4,5]. These three classes of SOD likely evolved from three different ancestral genes, giving rise to three distinct protein families with differences in amino acid sequences and molecular conformations. Fe-SOD and Mn-SOD are believed to be phylogenetically related and are thus classified into the same SOD family [6]. Different SODs also vary in their distribution in organism domains and two classes of SOD (Mn-SOD and Cu/Zn-SOD) have been identified in eukaryotes [7,8]. Additionally, the increased speed of evolution of the Cu/Zn-SODs in the past 100 million years may have further contributed to the molecular variation of the SOD class. In animals, two subclasses of Cu/Zn-SODs, the cytoplasmic Cu/Zn-SOD and the extracellular Cu/Zn-SOD, presumably evolved following independent evolutionary paths and are believed to function the cytosol and mitochondria and in the extracellular milieu, respectively [9][10][11][12].
Unlike most vertebrates, where a single molecular form for each of the Mn-SOD, cytosolic Cu/Zn-SOD, and extracellular Cu/Zn-SOD has been identified, some invertebrates appear to have much more complex SOD repertoire [4,12]. In bivalves, research at the genetic and genomic levels has evidenced the expansion of SOD genes, with identified genes in a species varying from 4 to 10 [13][14][15][16]. In addition to their diversity in gene numbers, bivalve SOD genes, particularly those of the Cu/Zn-SOD family, show high variations in the structure and the coded molecules. For example, the glycoprotein pernin isolated from the plasma of the New Zealand green-lipped mussel Perna canaliculus and SOD4, a potential protein predicted from the scallop Chlamys farreri, contain 3 and 4 Cu/Zn-SOD domains, respectively [14]. Moreover, BLAST searchbased analysis identified a gene in the Pacific oyster C. gigas genome that likely encodes a copper-only SODrepeat protein (CSRP), which represents a special form of the newly discovered Cu-only SODs [17]. Moreover, the purification and characterization of the most abundant protein in the plasma of 2 oyster species (i.e., Cg-EcSOD or cavortin for C. gigas and dominin for the eastern oyster Crassostrea virginica) as a homolog of extracellular Cu/Zn-SOD further illustrates the molecular diversity of SODs in bivalves [18][19][20] Therefore, the molecular diversity of the SOD families and their related evolutionary mechanisms are far from clear for bivalve mollusks.
Living cells produce ROSs during aerobic metabolism [12]. Under physiological conditions, some of these oxidative radicals are pivotal in ROS signaling [1,21]. However, when organisms are under environmental stress (e.g., pollution, pathogen invasion, abrupt changes in physicochemical conditions), excessive production of ROSs results in oxidative stress, leading to lipid, protein, and DNA damage in the host cells [1]. SODs play an important role in regulating ROS signaling and function as the first line of defense against oxidative damage [3]. The link between SOD malfunction including molecular mutations and a variety of pathological conditions in plants, animals and humans reflects the significance of these proteins for the maintenance of redox homeostasis in different organisms [22][23][24]. Moreover, recent findings have indicated that the cytosolic SOD1 of yeasts relocates to the nucleus to function as a transcription factor and regulate ROS signaling-related genes. Additionally, SOD1 activates a transcription factor involved in the response to copper deficiency. Collectively, these findings provide preliminary insights into the function of SOD1; however, its mechanisms of action yet to be elucidated [25,26].
Bivalves often face a wide variety of biotic and abiotic stressors that readily induce oxidative stress owing to the turbulent living environments in estuarine and coastal areas and their filter-feeding lifestyle [14,[27][28][29]. An increasing body of evidence indicates that the expression of SOD genes changes during stress responses in bivalves such as oysters [15], mussels [30], and clams [31,32]. However, some homologous proteins of SODs (e.g., C. gigas Cg-EcSOD/cavortin and C. virginica dominin) do not possess SOD enzyme activity and are thus unlikely to participate in the host antioxidant defense as a direct ROS scavenger [19,20]. Instead, recent studies have suggested their involvement in host immunity [18,33] and in metal transportation and storage related processes such as biomineralization [20,34]. Additionally, the functions and associated mechanisms of action of the multi-SOD domain containing CSRPs remain unknown. Therefore, the functions and regulatory mechanisms of SODs and their homologous proteins in oysters and possibly bivalve mollusks in general are likely more diverse than currently thought, and therefore additional studies are required to bridge these knowledge gaps.
Here, we identified and characterized the SODs of four oyster species (C. gigas, C. virginica, C. hongkongensis, and S. glomerata). Further, genome mapping was conducted, and the gene structure of SODs was characterized in the four aforementioned oyster species. The evolutionary relationship of SODs with other vertebrates and bivalves was also evaluated. Additionally, conserved motif and protein models were also constructed to identify their function. Finally, the cis-element in the promoter and the expression pattern of these SODs under abiotic and biotic stress were used to analyze stress tolerance.

Physicochemical characterisitics of predicted oyster sod family members
A total of 44 candidate SOD family genes, 13 in each of C. giga and C. virginica, 10 in C. hongkongensis, and 8 in S. glomerata, were identified in the 4 oyster species analyzed in the present research ( The proteins encoded by the identified oyster SOD family genes varied greatly in molecular characteristics ( Table 1). The 4 Mn-SOD homologs were predicted to have a polypeptide length varying from 224 to 256 aa, with a calculated molecular weight of 24.9-29.0 kDa and an isoelectric point (pI) of 6.49-6.66. The single domain Cu/Zn-SOD family members ranged from 100 aa to 545 aa long, with a calculated molecular weight of 10.3-55.9 kDa and a pI of 4.73-9.19, except one large molecular protein encoded by CH_C08G01508. The 2 and 4 Sod_Cu domain molecules were significantly larger than the 1 domain containing ones, with the 2 twodomain proteins having a molecular weight of 38.4 kDa and 52.4 kDa and the 7 four-domain proteins being 100.6-110.8 kDa.

Genome mapping, gene duplication and Synteny analysis of SOD family members
TBtools mapped the SOD family genes on 6 chromosomes in C. gigas and C. virginica, 5 chromosomes in C. hongkongensis genes, and 6 scaffolds in S. glomerata ( Fig. 1). Some Cu/Zn-SOD genes were present in a cluster on the same chromosome, with a chromosome of C. gigas and C. hongkongensis containing 3 tandem duplicated genes and a chromosome of C. virginica containing 4 tandem duplicated genes (Additional file 1: Table S1). The structural analysis predicted 4 exons in the Mn-SOD gene of C. gigas and C. virginica, and S. glomerata and 5 exons in that of C. hongkongensis. In contrast, the predicted exon numbers of the Cu/Zn-SOD family genes varied between 4 and 11. The synteny result revealed that those SODs were in 6 collinear regions in four oyster species ( Fig. 2 and Additional file 2: Fig. S1). The genes in the collinear region showed high similarity in four species. The Cu/Zn-SOD genes (cg_XM_034479061.1, CH_C06G00439, and Sgl021124) have more than 70% similarity with human SOD1 and in the same collinear region (Additional file 3: Table S2). The collinear analysis also showed that the Mn-SODs were conserved in four oyster species. In addition, the tandem duplicated genes of three oysters were in the collinear regions among four species.

Phylogenetic tree, sequence analysis and enzyme activity prediction
A phylogenetic tree was constructed with the amino acid sequences of SOD family proteins from the 4 oyster species and that from the 5 model animals (Homo sapiens, Mus musculus, Danio rerio, Drosophila melanogaster, and Caenorhabditis elegans) (Fig. 3). The molecules were grouped into 6 clades according to the phylogenetic and sequence analysis. Clade I, shadowed in light blue, encompassed 7 four-domain and 1 two-domain Cu/Zn-SOD homologs and all the coded proteins were predicted to distribute extracellular (Additional file 4: Table S3). Clade II, shadowed in watermelon red, included 9 single domain Cu/Zn-SODs that were predicted to distribute in cytoplasm (cyto-Cu/Zn-SODs) and 11 in extracellular milieu (ec-Cu/Zn-SODs). Clade III, shadowed in orange, included 4 cyto-Cu/Zn-SODs (95% bootstrap value), had different predicted protein model with other clades. Clade IV, shadowed in yellow, included the cyto-and ec-Cu/Zn-SODs. Clade V (86% bootstrap value), shadowed in green, Cu/Zn-SODs also included the cyto-and ec-members. Clade VI (100% bootstrap value), shadowed in light purple, included all the Mn-SOD molecules.
The MEME server predicted 10 conserved motifs (numbered 1 to 10) in the Cu/Zn-SOD sequences (Additional file 5: Table S4) and 3 motifs in the Mn-SOD sequences. Motif 1 (DHNHGLQIHEYGDMEHGCDTIGE-LYHNEH) contained six of the Cu/Zn-SOD conserved ligand site region and motif 2 contained the fourth Cu ligand histidine. Motif 1 and 2 also corresponded to the domain Sod_Cu (PF00080) and represented the Cu/Zn-SOD superfamily. In addition, motifs 8, 7, 2 contained two, one, one of the four Cu ligands, respectively. In Clade I, each Cu/Zn-SOD domains contained the combination motif 8-7-5-2. In Clade II, two kinds of combination of motif 1-2 and 1-3-2 represented the Cu/Zn-SOD domain. In Clade III, the 1-2 motif combination was identified. In Clade IV and Clade V, the common motif combination was 1-3-2. In addition, the 1-3-2 motif was the most frequent among all combinations in all the Cu/Zn-SODs. In Clade VI, the motif 11 to 13 covered the sodA#PF00081. All the oysters' sequences were similar in the predicted motif patterns to the family representative molecule from the model species.
Amino acid sequence alignments of the oyster SOD homologs with that of the representative molecules from model species identified variable number of conserved amino acid residues that assumingly served as metal ion ligands (CLs) (Fig. 3 conserved ligands). For family Cu/Zn-SODs, 7 residues are determined to serve as the CLs for the ions Cu and Zn in the enzyme active center and thus be essential for the SOD enzyme activity in known Cu/Zn-SODs [4,8,9]. In the 4-domain sequences in Clade I, each domain possessed 3-6 CLs (Additional file 6: Table S5). The sequences in Clade II had 6 (b and c) or 7 (a) CLs. The polypeptides encoded by cg_ XM_034479061.1, cg_XM_011436744.3, CH_C06G00439, cv_XM_022460203.1, Sgl021124, and Sgl011058 in this clade contained all the 4 Cu CLs and 2 or 3 of the Zn CLs. The CL numbers in the sequences grouped in Clades III-V varied between 2 and 4, with the partial loss of ligands for both Cu and Zn. For the Mn-SOD homologous sequences in Clade VI, all the 8 CLs, His-His-Asp-His for Mn 2+ linkage and His-Tyr-Gln for maintaining the active center confirmation [5], were conserved.
Structure modeling of all the oyster SOD homologous proteins with Phyre2 identified 7 most accurate structure models corresponding to the predicted molecules different in motif and CL compositions ( Fig. 3 predicted protein models a-g, Additional file 7: Table S6 by phyre2, Additional file 8: Table S7 by SWISS-MODEL). The structural model developed on the polypeptides in Clade I showed a SOD active center that contained a copper ion, but no zinc ion (model a). The predicted structure of proteins in Clade II involved 3 models that showed an active center containing a copper ion (model a), both copper and zinc ions (model b), or a zinc ion (model c). The proteins in Clades III, IV and V were developed into another 3 models (models d, e, f ) that all lacked a metal ion containing active center. The model a-f also had a different number of α helix and coil/β-turns while they shared some structural elements in particular the eight β strands. The existed of eight β strands may contribute to the similar structure between models a-f. The CLs meanly existed on the α helix or coil/β-turns, may cause the loss of ligand metals. However, the structure model of the proteins in Clade VI developed based on 2 known Fe/ Mn-SODs (Fe-SOD and Mn-SOD are same protein only varied in ligand ions) differed significantly in confirmation from the other with the other models and showed an active center that contained a manganese ion (model g).

Cis-regulatory elements in the regulatory region of sod genes
A total of 105 transcription factors (TFs) involving 25 families as the cis-elements were identified in the promoter of SODs in the Pacific oyster ( Fig. 4 and Additional file 9: Table S8). Androgen receptor represented the most frequently identified TF in all the Pacific oyster SOD genes. Approximately 30% of the TFs involving 14 TF families were unique to the oyster SODs. Among them, the family Forkhead (FOX) was the most frequently identified. These unique TFs were present in the Mn-SOD gene (cg_NM_001308918.1), and some Cu/Zn-SOD family genes (Clade I, cg_XM_011416304.3; The numbers of cis-element identified in individual genes varied between 2 and 6 (Additional file 10: Table S9). The promoter of cg_NM_001308918.1(Mn-SOD) and cg_XM_011416304.3, for example, contained 6 cis-elements, while cg_XM_011430838.3 only had 2 cis-elements. Most genes were identified to have 3-5 ciselements in their promoter.

SOD expression profiles in development stage and adult pacific oyster tissues
During embryonic and larval development, the expression patterns varied among SOD genes (Fig. 5a). A set of genes, including two cyto-Cu/Zn-SODs (seven CLs, cg_XM_034479061.1, two CLs, cg_XM_011448476.3), a multi-domain-SOD (cg_XM_011416304.3), and Mn-SOD, were detected in all developmental stages. The expression level of two ec-Cu/Zn-SODs that contained 6 and 7 CLs exhibited an increasing trend from the gastrula stage. A cyto-Cu/Zn-SOD and three ec-Cu/Zn-SOD that contained three CLs expressed an extremely high level during the juvenile stage but almost not expressed in other stages (Fig. 5a). Additionally, those four genes were the most highly expressed among all SOD genes in adult tissues (Fig. 5b). The cyto-Cu/ Zn-SOD containing seven CLs and Mn-SOD showed  higher mRNA level in all tissues and with similar expression levels among different tissues.

Expression pattern of SOD genes under abiotic and biotic stress
We further investigated the transcriptional profiles of SOD genes in adult Pacific oysters subjected to abiotic and biotic stress. The genes in Clade V (cg_XM_011416093.3, cg_XM_011416094.3, cg_XM_011441495.3, and cg_NM_001308888.2) were highly expressed and up-regulated at 25 °C under different levels of temperature stress (Fig. 6a,b). The Mn-SOD and cyto-Cu/Zn-SOD that contained seven and two CLs (cg_XM_034479061.1 and cg_ XM_011416094.3) were up-regulated under air exposure stress (Fig. 6c,d). Under low salinity (salinity 5, 10 psu), six SODs were down-regulated, whereas the cyto-Cu/Zn-SOD that contained seven CLs and Mn-SOD were up-regulated in eight responsive genes (Fig. 6e,f). In response to biotic stress, six genes (including four SODs in Clade V, one multi-domain-SOD in Clade I (cg_ XM_011416304. 3), and one cyto-Cu/Zn-SOD in Clade II contained seven CLs) exhibited significant changes (P ≤ 0.05) with temporal activation (Fig. 7), whereas the mRNA level of other genes showed no differences (P > 0.05) during infection. The four SODs in Clade V exhibited the same trends, thus demonstrating that these genes were down-regulated in response to infection, while the multi-domain-SOD (cg_XM_011416304.3) and cyto-Cu/Zn-SOD containing seven CLs were up-regulated during Vibrio infection. "I" to "VI" represented the six clades. Circles, triangles, and no mark near the tree represented the cytosolic, mitochondrial, and extracellular sub-cellular of SODs, respectively. The proteins in seven colors represented the protein models predicted by phyre2. In the Domain/motif part, colored cylinders (green and pink) represent the Cu/Zn-SOD and Mn-SOD domains. The rectangles numbered one to thirteen represent motifs. The Conserved Ligands part displayed the most important ligands for enzyme activity of SODs. The "i", "ii", "iii", and "iv"in both second and third part were the numeric of multi-domain Cu/Zn-SOD (clade I). "a" to "g" in the third and fourth part represented the corresponding models of encoded proteins. In the protein model part, the models were predicted by SWISS-MODEL based on the model predicted in phyre2, so, the same color of the letters in the first, third, and fourth part were same. The orange sphere in "a" and "b" was the copper ion, the blue sphere in "b" and "c" was zinc ion, the purple sphere in "g" was manganese ion

Discussion
Our findings indicated that the SOD family genes expanded in four oyster species (C. gigas, C. virginica, C. hongkongensis, and S. glomerata) compared with those of vertebrates [35,36]. ec-SODs represent the majority of Cu/Zn-SODs, which is consistent with that the ec-SOD evolutionary history is longer than that of cytoplasmic Cu/Zn-SODs [9]. The SODs were clustered into six clades according to the phylogenetic analysis. The Mn-SODs of the four oyster species had two kinds of protein models, which were predicted to be different chains of iron-SOD [37]. Our results thus confirmed that Mn-SODs are much conserved than Cu/Zn-SODs [38].
For the Cu/Zn-SODs, the multi-domain-SODs and single-domain SODs were first separated. The predicted protein model of multi-domain-SODs in clade I was heterodimers of the mutant SOD enzyme, which had been reported in previous studies. In Chlamys farreri, one multi-domain-SOD was identified, whereas two to three genes were identified in the three Crassostrea oysters [14]. Multi-domain-SODs have only been identified in several aquatic organisms and winged insects [17], however, their function is not characterized yet. In the present study, we found that the predicted protein model showed they were Cu-only SODs and might have the SOD enzyme activity. In the expression pattern study, a multi-domain-SOD (cg_XM_011430838.3) expressed lowest level in different tissues and under abiotic and biotic stress. Another two multi-domain SODs (cg_XM_011416304.3 and cg_XM_034474372.1) were down-regulated under abiotic and biotic stress (Figs. 5 and 6). The mRNA expression of Cu-only SOD (multidomain-SOD in this study) in the Pacific oyster decreased significantly in gills exposed to Cu but increased under Pb exposure [39]. The function of multi-domain-SODs in fungi was related to defense host-derived oxidative stress [40]. The function of oyster multi-domain-SODs is still unclear, and therefore their functions and expression patterns should be further characterized.
Except for the Mn-SOD and multi-domain-SODs, single domain SODs were separated into four subgroups according to the number of CLs. Cyto-Cu/Zn-SODs contained seven CLs, among which cg_XM_034479061.1 and CH_C06G00439 were predicted to have the same structure as human SOD1 [41] and were more likely to possess enzyme activity [9]. The protein structures of ec-Cu/Zn-SODs, which contained two CLs (clade V), had been purified and identified from C. virginica [20] and C. gigas [19]. The studies revealed that these ec-Cu/Zn-SODs (major plasma proteins) possessed the features of the Cu/Zn-SOD domain but lacked SOD enzyme activity [20]. Thus, the cyto-Cu/Zn-SODs that contained two CLs (cg_XM_034462555.1, and cg_XM_011448476.3) might lack SOD enzyme activity similar to the ec-Cu/ Zn-SODs that contained two CLs (cg_XM_011441495.3,  cg_NM_001308888.2, and cg_XM_011416093.3). In the present study, the recombinant protein of cg_ XM_034479061.1 was proved to have SOD activity, and the loss of Cu CLs in cg_XM_034479061.1 caused the SOD activity could not be detected [42]. The result proved the importance of Cu ligands. The predicted cyto-Cu/Zn-SOD (cg_XM_011416094.3) was a ec-Cu/Zn-SOD in the previous study and lack SOD enzyme activity [43]. Although the SOD1 mutant of the CLs might cause enzyme activity loss and gain other kinds of function in humans [24,44], Cu/Zn-SODs with no enzyme activity may participate in the oxidation defense by transporting metal [20,34,39,43]. Additionally, those oyster cyto-Cu/ Zn-SODs contained three or two CLs were predicted as the heterodimer between the copper chaperone for SOD and SOD1, which may facilitate or regulate copper delivery [45]. Thus, the SODs that contained two CLs might provide different protective functions compared with SOD1 in humans. The cyto-Cu/Zn-SOD that contained seven CLs was up-regulated, whereas the ec-Cu/ Zn-SOD that contained six CLs (all Cu CLs remained) cg_XM_011436744.3 was not significantly up-regulated in response to biotic and abiotic stress. Those result suggested that the only single domain Cu-only gene (cg_XM_011436744.3) might be a pseudogene or nearly no function by no expression in the adult organisms. All of the results revealed that the cyto-Cu/Zn-SOD with seven CLs (cg_XM_034479061.1) might be the only cyto-Cu/Zn-SOD with SOD enzyme activity in C. gigas. For other ec-Cu/Zn-SODs in C. gigas, 4 (Clade V) in 6 were proved with no SOD enzyme activity [20,42]. Another ec-Cu/Zn-SOD (cg_XM_011436744.3, Clade II,) was predicted to lost 1 of 4 Cu ligands, might lack the SOD activity according to previous study [42,46]. Thus, except for the multi-domain Cu/Zn-SODs, there might be no or only one (cg_XM_011436744.3) ec-Cu/Zn-SOD play the role with SOD activity in C. gigas. Although all the Cu/Zn-SOD possessed the SOD-domain, variable functions have been identified in previous studies [20,34,40], and therefore the presence of domains is not enough to indicate the function of genes, the conservation of key functional sites and other aspects to facilitate need to be considered in future analyses. Altogether, our findings indicate that SODs in oysters may be categorized into five groups according to their functional diversity (Mn-SOD enzyme, Cu-only-SODs, Cu/Zn ion ligand Cu/Zn-SOD with enzyme activity, Zn-only-SODs, and no ligand metal ions Cu/Zn-SOD); however, the relationship between different functions requires further characterization.
We further studied the cis-element and expression patterns of C. gigas SODs due to their wide variations. The androgen receptor (AR) was present in most SODs. It had widespread expression in many cells and tissues, and the transcriptional regulatory effects of the AR included pathways involved in cell growth and proliferation, cell cycle progression, protein synthesis, and cell death [47]. Estrogen receptor 1 (ESR1) was found in the promoter of six Cu/Zn-SODs. AR and ESR1 are members of the steroid hormone nuclear receptor family, which was involved in a wide variety of physiological functions, including control of embryonic development, cell differentiation, and homeostasis [48]. The chromosome remodeling factor family members are key components involved in DNA replication and include histone chaperones, histone-modifying enzymes, and ATP-dependent chromatin remodeling complexes. The ec-Cu/Zn-SOD (cg_ NM_001308888.2) had the most TFs compared with the other two ec-Cu/Zn-SODs in clade V. All of these ec-Cu/ Zn-SODs were not up-regulated at low or high temperatures (35 °C). These ec-Cu/Zn-SODs were distributed in clusters on one chromosome, which was consistent with a previous study that the Pacific oyster was enriched with duplication regions [49]. The duplication of genes might still share transcriptional characteristics as demonstrated by the expression patterns under abiotic and biotic stress. The similar expression profile might due to the preservation of the promoter of duplicated genes [50]. A previous study showed that the Cu/Zn-SODs that contained two CLs in Clade V (cg_XM_011416093.3, cg_XM_011416094.3, cg_XM_011441495.3, and cg_ NM_001308888.2) were up-regulated in response to metal-induced stress, suggesting that they might participate in metal detoxification [39]. In the absence of CLs, oxidative defense mechanisms may lose, for instance, human mutant SOD1 lacked the Cu or Zn ligand caused the SOD1 malfunction, which leads to loss of SOD activity [23,41]. The genes in the same clade shared some similar TFs, that is critical for their response to the same condition. In addition, unique TFs of genes in Clade V may contribute to their metal detoxification (sub-functionalization process) [50,51]. Therefore, more studies are needed to analyze the diversity of the cis-elements in promoters to gain insights into the functional evolution of genes. The diversity of cis-elements in the promoter revealed the potential regulatory relationship between different cell processes [52]. Our work lays the foundation for future comparative studies to elucidate the functional diversity of SOD family and underlying mechanisms.

Conclusions
SOD genes were analyzed in the genomes of four oyster species. Cu/Zn-SOD exhibited a wide diversity of gene family numbers and sub-cellular localization, chromosomal distribution, gene structure, and conserved ligand amino acids (CLs), whereas these features were highly conserved for Mn-SOD. Most extracellular Cu/Zn-SOD proteins appeared to lose SOD enzyme activity in oysters due to the lack of CLs despite the occurrence of more than one gene and higher levels of gene expression. Therefore, the functions of this kind of Cu/Zn-SOD must be further characterized. Multi-domain-SODs are widely distributed in oysters and some of them responded to biotic and abiotic stressors in the Pacific oyster. Further, our study revealed that only one cytosolic Cu/Zn-SOD (cg_XM_034479061.1) may conserve enzymatic activity of SOD. Moreover, the expression patterns of these genes varied in response to different stressors, which may be due to the cis-elements in the promoter. Our results clarified the gene family members and potential functions of SODs in oysters, as well as the unusual diversity of SODs. These findings provide important insights into the mechanisms by which oysters adapt to harsh intertidal conditions and allows for the identification of precise biomarkers of stress response in oysters and other invertebrates.

Identification, sequence and structure analyses of SOD family members
The initial chromosome/scaffold sequences were downloaded from a public online database, including C. gigas (assembly cgigas_uk_roslin_v1, GCA_902806645.1) [53] and GCA_002022765.4 C_virginica-3.0 on NCBI, C. hongkongensis genome [54] at the China National Gen-Bank (CNGB, https:// db. cngb. org/, No. CNA0003280), as well as the Sydney rock oyster S. glomerata genome online database (http:// soft. bioin fo-minzh ao. org/ srog/) [16]. Four oyster genome files were used for the identification of SOD genes. The "cg_" and "cv_" tags were appended before the ID of these SOD genes in the published genome of C. gigas and C. virginica to distinguish these two species. Further, the IDs of C. hongkongensis SODs start with "CH" and those of S. glomerata start with "Sgl" in this study. All candidate SOD sequences with a significant BLAST hit (E-value ≤1.00 × 10 − 10 ) and SOD domain (PF00080, PF00081, and PF02777) hit (E-value ≤1.00 × 10 − 10 ) were obtained using the BLAST software and HMMER (http:// www. ebi. ac. uk/ Tools/ hmmer/ search/ phmmer). The presence of the SOD domain was further verified using the SMART (http:// smart. emblheide lberg. de/) web tools. The genes with partial coding sequences were filtered in the following study.

Chromosomal localization, gene duplication and Synteny analysis of SOD family members
The chromosomal positions of SODs were visualized by the TBtools software [61]. The SOD gene duplication was confirmed according to two criteria: (1) the coverage of aligned sequence was more than 70% of longer sequence, and (2) the similarity of the two aligned sequences were more than 70% [62]. Duplicated genes separated by five or fewer genes in 100-kb chromosome fragment were considered as tandem duplication genes. The synteny of SODs between four species were calculated by MCscanX method and visualized with TBtools [61].

Cis-acting elements in the promoter regions
To identify cis-regulatory elements in the SOD family genes, the 2000 bp upstream of the translation start site were retrieved from the genome of the Pacific oyster using TBtools [61]. The elements were then predicted using the AnimalTFDB 3.0 online software (http:// bioin fo. life. hust. edu. cn/ Anima lTFDB/) [68]. The candidate transcription factors (TFs) with q-value < 10 − 5 were selected.

Expression profiles of sods in developing stages and adult tissues of pacific oysters
The expression profiles of SOD genes in different developmental stages and adult tissues of Pacific oysters were analyzed using RNA-seq data [15]. The sampled developmental stages included eggs, two/four cells cell-stage embryos, early morula, morula, blastula, gastrula, trochophore, D-shape larvae, early-umbo larvae, late-term umbo larvae, pediveliger and juvenile. Collected soft tissues of adults included the mantle, gill, adductor muscle, digestive gland, hemocyte, and gonads. All samples were frozen in liquid nitrogen. The RNA-seq raw data of developing stages numbered from SRR334222 to SRR334259 and the adult tissue data numbered from SRR334212 to SRR334220 in the NCBI SRA database were downloaded. The generated RNA-seq reads were mapped to the C. gigas genome using Hisat2 (V2.1.0) [69]. The expression of all SOD genes was normalized and reported as fragments per kilobase of exon model per million mapped (FPKMs).
For the biotic stress experiments, the acclimated oysters were injected with 100 μL of Vibrio mediterranei isolated from moribund Sinonovacula constricta, after which digestive gland tissues were sampled at 1, 3, 6, 12, 24 h post-injection. Tissue samples were frozen in liquid nitrogen and stored at − 80 °C until RNA extraction. RNA was extracted using the Trizol reagent. The extracted RNA samples were sequenced on the Illumina HiSeq 2000 platform. The expression level of all genes was quantified using Salmon v1.2.1 [70] and reported as transcripts per kilobase of exon model per million mapped reads (TPM).

Statistical analyses
Statistical analyses were performed using the SPSS 22.0 software (IBM; NY, USA). One-way analysis of variance (ANOVA) followed by Duncan's test was used to evaluate the expression levels of SOD genes in response to Vibrio stress. The results were considered significant at P < 0.05.